Picture for Ziming Li

Ziming Li

Counterfactual Graph for Multi-Agent LLM Calibration

Add code
May 28, 2026
Viaarxiv icon

Reinforcement Learning with Robust Rubric Rewards

Add code
May 28, 2026
Viaarxiv icon

On the Safety of Graph Representation Learning

Add code
May 07, 2026
Viaarxiv icon

Kwai Summary Attention Technical Report

Add code
Apr 27, 2026
Viaarxiv icon

Visual Preference Optimization with Rubric Rewards

Add code
Apr 14, 2026
Viaarxiv icon

Not All Tokens See Equally: Perception-Grounded Policy Optimization for Large Vision-Language Models

Add code
Apr 02, 2026
Viaarxiv icon

Kelix Technical Report

Add code
Feb 12, 2026
Viaarxiv icon

Graph is a Substrate Across Data Modalities

Add code
Jan 29, 2026
Viaarxiv icon

Scaling Rough Terrain Locomotion with Automatic Curriculum Reinforcement Learning

Add code
Jan 24, 2026
Viaarxiv icon

SIN-Bench: Tracing Native Evidence Chains in Long-Context Multimodal Scientific Interleaved Literature

Add code
Jan 15, 2026
Viaarxiv icon